Minimax Q-learning control for linear systems using the Wasserstein metric
نویسندگان
چکیده
Stochastic optimal control usually requires an explicit dynamical model with probability distributions, which are difficult to obtain in practice. In this work, we consider the linear quadratic regulator (LQR) problem of unknown systems and adopt a Wasserstein penalty address distribution uncertainty additive stochastic disturbances. By constructing equivalent deterministic game penalized LQR problem, propose Q-learning method convergence guarantees learn minimax controller.
منابع مشابه
P14: Anxiety Control Using Q-Learning
Anxiety disorders are the most common reasons for referring to specialized clinics. If the response to stress changed, anxiety can be greatly controlled. The most obvious effect of stress occurs on circulatory system especially through sweating. the electrical conductivity of skin or in other words Galvanic Skin Response (GSR) which is dependent on stress level is used; beside this parameter pe...
متن کاملPerformance and robustness analysis of stochastic jump linear systems using Wasserstein metric
This paper focuses on the performance and the robustness analysis of stochastic jump linear systems. The realization of the state trajectory under stochastic jump processes becomes random variables, which brings forth the probability distributions for the system state. Therefore, a proper metric is necessary to measure the system performance with respect to stochastic switching. In this perspec...
متن کاملMinimax Statistical Learning and Domain Adaptation with Wasserstein Distances
As opposed to standard empirical risk minimization (ERM), distributionally robust optimization aims to minimize the worst-case risk over a larger ambiguity set containing the original empirical distribution of the training data. In this work, we describe a minimax framework for statistical learning with ambiguity sets given by balls in Wasserstein space. In particular, we prove a generalization...
متن کاملp14: anxiety control using q-learning
anxiety disorders are the most common reasons for referring to specialized clinics. if the response to stress changed, anxiety can be greatly controlled. the most obvious effect of stress occurs on circulatory system especially through sweating. the electrical conductivity of skin or in other words galvanic skin response (gsr) which is dependent on stress level is used; beside this parameter pe...
متن کاملDynamic clustering of histograms using Wasserstein metric
In the present paper we present a new distance, based on the Wasserstein metric, in order to cluster a set of data described by distributions with finite continue support. The proposed distance allows to define a measure of inertia of data with respect a barycenter that satisfies the Huygens theorem of decomposition of inertia. Thus, this measure is proposed as allocation function in the dynami...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Automatica
سال: 2023
ISSN: ['1873-2836', '0005-1098']
DOI: https://doi.org/10.1016/j.automatica.2022.110850